Overview

Dataset statistics

Number of variables 122
Number of observations 95115
Missing cells 5865966
Missing cells (%) 50.6%
Duplicate rows 0
Duplicate rows (%) 0.0%
Total size in memory 88.5 MiB
Average record size in memory 976.0 B

Variable types

CAT 60
BOOL 44
NUM 14
DATE 3
UNSUPPORTED 1

Warnings

APPLICATION_COMMON_APP_ID has a high cardinality: 94944 distinct values High cardinality
PERSON_REFERENCE_ID has a high cardinality: 95028 distinct values High cardinality
PERSON_CU_SIS_ID has a high cardinality: 94992 distinct values High cardinality
PERSON_BIRTHDATE has a high cardinality: 1873 distinct values High cardinality
ADDRESS_STREET_COMBINED has a high cardinality: 91588 distinct values High cardinality
ADDRESS_CITY has a high cardinality: 6135 distinct values High cardinality
ADDRESS_COUNTY has a high cardinality: 1048 distinct values High cardinality
ADDRESS_REGION has a high cardinality: 683 distinct values High cardinality
ADDRESS_POSTAL has a high cardinality: 85693 distinct values High cardinality
ADDRESS_US_5_DIGIT_ZIP_CODE has a high cardinality: 8829 distinct values High cardinality
ADDRESS_COUNTRY has a high cardinality: 119 distinct values High cardinality
PERSON_NATIVE_LANGUAGE has a high cardinality: 53 distinct values High cardinality
PERSON_HS_GPA_SR has a high cardinality: 8559 distinct values High cardinality
1_HIGH_SCHOOL_CEEB_CODE has a high cardinality: 9661 distinct values High cardinality
1_HIGH_SCHOOL_NAME has a high cardinality: 13440 distinct values High cardinality
1_HIGH_SCHOOL_REGION has a high cardinality: 263 distinct values High cardinality
1_HIGH_SCHOOL_CITY has a high cardinality: 6651 distinct values High cardinality
APPLICATION_ORIGINAL_ACADEMIC_INTEREST has a high cardinality: 84 distinct values High cardinality
APPLICATION_SUBMITTED_DATE has a high cardinality: 859 distinct values High cardinality
ADMIT_DECISION_RELEASED_DATE has a high cardinality: 557 distinct values High cardinality
ADMIT_DECISION_RECEIVED_DATE has a high cardinality: 28071 distinct values High cardinality
DEPOSIT_DECISION_CONFIRMED_DATE has a high cardinality: 12437 distinct values High cardinality
PAST_APP_HISTORY has a high cardinality: 328 distinct values High cardinality
MOST_RECENT_ON_CAMPUS_EVENT has a high cardinality: 82 distinct values High cardinality
HS_CITY_LOCATION has a high cardinality: 3170 distinct values High cardinality
HS_NCES_ID has a high cardinality: 6674 distinct values High cardinality
HS_NAME has a high cardinality: 6179 distinct values High cardinality
HS_STATE has a high cardinality: 52 distinct values High cardinality
HS_ZIP has a high cardinality: 4880 distinct values High cardinality
HS_ADDRESS has a high cardinality: 5482 distinct values High cardinality
HS_CEEB has a high cardinality: 6691 distinct values High cardinality
PERSON_MULTIPLE_APPLICATIONS has 93963 (98.8%) missing values Missing
ETHNICITY has 1304 (1.4%) missing values Missing
ADDRESS_COUNTY has 2931 (3.1%) missing values Missing
PERSON_HOME_SCHOOL has 95019 (99.9%) missing values Missing
PERSON_ACT_MAX_COMPOSITE has 48103 (50.6%) missing values Missing
PERSON_SATI_MAX_TOTAL has 94977 (99.9%) missing values Missing
PERSON_SATR_MAX_COMPS has 33220 (34.9%) missing values Missing
PERSON_24_OR_MORE_HOURS_COLLEGE_WORK has 93438 (98.2%) missing values Missing
PERSON_APPLICANT_FIRST_GENERATION has 82644 (86.9%) missing values Missing
PARENT_CU_ATTENDANCE has 94732 (99.6%) missing values Missing
PARENT_CU_EMPLOYMENT has 94602 (99.5%) missing values Missing
PERSON_ENGLISH_NATIVE_LANGUAGE has 93605 (98.4%) missing values Missing
PERSON_NATIVE_LANGUAGE has 1896 (2.0%) missing values Missing
PERSON_GROSS_FAMILY_INCOME has 31492 (33.1%) missing values Missing
PERSON_FAFSA_SUBMITTED has 33655 (35.4%) missing values Missing
PERSON_MILITARY_STATUS has 94984 (99.9%) missing values Missing
PERSON_HS_RANK_PERCENTILE has 60005 (63.1%) missing values Missing
PERSON_HS_EQUIVALENCY has 95008 (99.9%) missing values Missing
PERSON_CUMULATIVE_GPA has 86235 (90.7%) missing values Missing
PERSON_HS_GPA_SR has 29472 (31.0%) missing values Missing
1_HIGH_SCHOOL_REGION has 2090 (2.2%) missing values Missing
1_HIGH_SCHOOL_HONORS has 95115 (100.0%) missing values Missing
CUB_IS__1_CHOICE has 67074 (70.5%) missing values Missing
APPLICATION_CU_SIS_APP_MATRICULATED has 20344 (21.4%) missing values Missing
ADMIT_DECISION_RECEIVED_DATE has 14935 (15.7%) missing values Missing
DEPOSIT_DECISION_CONFIRMED_DATE has 71523 (75.2%) missing values Missing
APPLICATION_HOUSING_APPLICATION_COMPLETED has 74462 (78.3%) missing values Missing
PAST_APP_HISTORY has 93941 (98.8%) missing values Missing
PERSON_ENGAGEMENT has 53021 (55.7%) missing values Missing
MOST_RECENT_ON_CAMPUS_EVENT has 67817 (71.3%) missing values Missing
DATE_MOST_RECENT_ON_CAMPUS_EVENT has 67817 (71.3%) missing values Missing
DEPOSIT_PAID_CREATED_DATE has 71523 (75.2%) missing values Missing
CURRENT_BIN_NAME has 57359 (60.3%) missing values Missing
RELATION_LEGAL_GUARDIAN_HIGHEST_EDUCATION has 95101 (> 99.9%) missing values Missing
RELATION_LEGAL_GUARDIAN_RELATIONSHIP_CU_ATTENDANCE has 95114 (> 99.9%) missing values Missing
RELATION_LEGAL_GUARDIAN_RELATIONSHIP_CU_EMPLOYMENT has 95114 (> 99.9%) missing values Missing
RELATION_SIBLING_HIGHEST_EDUCATION has 95011 (99.9%) missing values Missing
RELATION_SIBLING_RELATIONSHIP_CU_EMPLOYMENT has 95002 (99.9%) missing values Missing
RELATION_SIBLING_RELATIONSHIP_CU_ATTENDANCE has 94992 (99.9%) missing values Missing
RELATION_STEP_PARENT_HIGHEST_EDUCATION has 95094 (> 99.9%) missing values Missing
RELATION_STEP_PARENT_RELATIONSHIP_CU_ATTENDANCE has 95094 (> 99.9%) missing values Missing
RELATION_STEP_PARENT_RELATIONSHIP_CU_EMPLOYMENT has 95094 (> 99.9%) missing values Missing
RELATION_GRANDPARENT_HIGHEST_EDUCATION has 95105 (> 99.9%) missing values Missing
RELATION_GRANDPARENT_RELATIONSHIP_CU_EMPLOYMENT has 95106 (> 99.9%) missing values Missing
RELATION_GRANDPARENT_RELATIONSHIP_CU_ATTENDANCE has 95105 (> 99.9%) missing values Missing
APPLICATION_FEE_WAIVER___SCHOOL_SPECIFIC has 83799 (88.1%) missing values Missing
APPLICATION_ASSET_ELIGIBLE_WITH_HIGH_SCHOOL_TRANSCRIPT has 95078 (> 99.9%) missing values Missing
APPLICATION_ASSET_ELIGIBLE_WITHOUT_HIGH_SCHOOL_TRANSCRIPT has 94977 (99.9%) missing values Missing
APPLICATION_SCHOOL_SPECIFIC_FEE_WAIVER__COMMON_APP_NON_FINANCIAL has 67504 (71.0%) missing values Missing
PERSON_YEARS_LIVED_IN_U_S has 93294 (98.1%) missing values Missing
PERSON_YEARS_LIVED_OUTSIDE_U_S has 34832 (36.6%) missing values Missing
APPLICATION_ARTS___HUMANITIES_INSTATE_SCHOLAR has 94896 (99.8%) missing values Missing
APPLICATION_ARTS___HUMANITIES_OUT_OF_STATE_SCHOLAR has 94038 (98.9%) missing values Missing
APPLICATION_BAKER_SCHOLAR has 90763 (95.4%) missing values Missing
APPLICATION_CHANCELLOR_SCHOLAR has 71674 (75.4%) missing values Missing
APPLICATION_DEAN_SCHOLAR has 93802 (98.6%) missing values Missing
APPLICATION_HALE_SCHOLAR has 93063 (97.8%) missing values Missing
APPLICATION_IMPACT_SCHOLARSHIP has 94703 (99.6%) missing values Missing
APPLICATION_LEEDS_SCHOLAR_ADMIT has 94868 (99.7%) missing values Missing
APPLICATION_LEEDS_SCHOLAR_APPLICANT has 94885 (99.8%) missing values Missing
APPLICATION_MUSIC_SCHOLARSHIP has 95114 (> 99.9%) missing values Missing
APPLICATION_PRESIDENTIAL_SCHOLAR has 91753 (96.5%) missing values Missing
APPLICATION_REGENT_SCHOLARSHIP has 94131 (99.0%) missing values Missing
APPLICATION_SEWALL_SCHOLAR has 92198 (96.9%) missing values Missing
APPLICATION_EXCEL has 93729 (98.5%) missing values Missing
PERSON_TUITION_CLASSIFICATION has 67510 (71.0%) missing values Missing
APPLICATION_ADMITTED_FROM_WAITLIST has 93820 (98.6%) missing values Missing
PERSON_PRE_COLLEGIATE_BOULDER_SUMMER_STUDENTS has 94860 (99.7%) missing values Missing
PERSON_PRE_COLLEGIATE_RURAL has 95091 (> 99.9%) missing values Missing
PERSON_PRE_COLLEGIATE_WESTERN_SLOPE has 94968 (99.8%) missing values Missing
PERSON_PRE_COLLEGIATE_SYSTEM_STUDENTS has 94898 (99.8%) missing values Missing
PERSON_DANIEL_S_SCHOLAR has 94925 (99.8%) missing values Missing
PERSON_ENGINEARME has 95086 (> 99.9%) missing values Missing
HS_CITY_LOCATION has 25798 (27.1%) missing values Missing
HS_TOT_ENROLLMENT has 12240 (12.9%) missing values Missing
HS_NCES_ID has 12234 (12.9%) missing values Missing
HS_NAME has 12234 (12.9%) missing values Missing
HS_TYPE has 12234 (12.9%) missing values Missing
HS_STATE has 25798 (27.1%) missing values Missing
HS_TEACHERS_FTE has 12403 (13.0%) missing values Missing
HS_URBAN_CENTRIC_LOCALE_CODE has 12234 (12.9%) missing values Missing
HS_ZIP has 25798 (27.1%) missing values Missing
HS_ADDRESS has 25798 (27.1%) missing values Missing
HS_TYPE_VALUE has 12234 (12.9%) missing values Missing
HS_CLASSIFICATION has 12234 (12.9%) missing values Missing
HS_CEEB has 12234 (12.9%) missing values Missing
1_HIGH_SCHOOL_HONORS is an unsupported type, check if it needs cleaning or further analysis Unsupported
TOTAL_PING_TIME__SEC has 9336 (9.8%) zeros Zeros
VIRTUAL_EVENT_ENGAGEMENT_COUNT has 94294 (99.1%) zeros Zeros
ON_CAMPUS_ENGAGEMENT_COUNT has 67817 (71.3%) zeros Zeros
OFF_CAMPUS_ENGAGEMENT_COUNT has 82972 (87.2%) zeros Zeros

Reproduction

Analysis started 2020-10-28 19:33:37.773733
Analysis finished 2020-10-28 19:33:47.313935
Duration 9.54 seconds
Software version pandas-profiling v2.9.0
Download configuration config.yaml

Variables

YEAR
Real number (ℝ≥0)

Distinct 5
Distinct (%) < 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Mean 2019.124397
Minimum 2018
Maximum 2022
Zeros 0
Zeros (%) 0.0%
Memory size 743.1 KiB
2020-10-28T13:33:47.368463 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/

Quantile statistics

Minimum 2018
5-th percentile 2018
Q1 2018
median 2019
Q3 2020
95-th percentile 2020
Maximum 2022
Range 4
Interquartile range (IQR) 2

Descriptive statistics

Standard deviation 0.8425475792
Coefficient of variation (CV) 0.0004172836407
Kurtosis -1.300583467
Mean 2019.124397
Median Absolute Deviation (MAD) 1
Skewness -0.09789384025
Sum 192049017
Variance 0.7098864233
Monotocity Not monotonic
2020-10-28T13:33:47.512363 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=5)
Value Count Frequency (%)  
2020 36428 38.3%
 
2019 30109 31.7%
 
2018 27251 28.7%
 
2021 1326 1.4%
 
2022 1 < 0.1%
 
Value Count Frequency (%)  
2018 27251 28.7%
 
2019 30109 31.7%
 
2020 36428 38.3%
 
2021 1326 1.4%
 
2022 1 < 0.1%
 
Value Count Frequency (%)  
2022 1 < 0.1%
 
2021 1326 1.4%
 
2020 36428 38.3%
 
2019 30109 31.7%
 
2018 27251 28.7%
 

APPLICATION_COMMON_APP_ID
Categorical

HIGH CARDINALITY

Distinct 94944
Distinct (%) 100.0%
Missing 171
Missing (%) 0.2%
Memory size 743.1 KiB
17360449-2018
 
1
19401549-2018
 
1
21414448-2019
 
1
22867830-2020
 
1
20256387-2019
 
1
Other values (94939)
94939 
Value Count Frequency (%)  
17360449-2018 1 < 0.1%
 
19401549-2018 1 < 0.1%
 
21414448-2019 1 < 0.1%
 
22867830-2020 1 < 0.1%
 
20256387-2019 1 < 0.1%
 
24406463-2020 1 < 0.1%
 
22418107-2019 1 < 0.1%
 
20547442-2019 1 < 0.1%
 
18931316-2018 1 < 0.1%
 
24954644-2020 1 < 0.1%
 
Other values (94934) 94934 99.8%
 
(Missing) 171 0.2%
 
2020-10-28T13:33:47.847355 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 94944 ?
Unique (%) 100.0%

PERSON_REFERENCE_ID
Categorical

HIGH CARDINALITY

Distinct 95028
Distinct (%) 99.9%
Missing 0
Missing (%) 0.0%
Memory size 743.1 KiB
481696362
 
2
727223977
 
2
370203402
 
2
298165589
 
2
137521878
 
2
Other values (95023)
95105 
Value Count Frequency (%)  
481696362 2 < 0.1%
 
727223977 2 < 0.1%
 
370203402 2 < 0.1%
 
298165589 2 < 0.1%
 
137521878 2 < 0.1%
 
5205839 2 < 0.1%
 
609939978 2 < 0.1%
 
305979617 2 < 0.1%
 
183885516 2 < 0.1%
 
937282944 2 < 0.1%
 
Other values (95018) 95095 > 99.9%
 
2020-10-28T13:33:48.191742 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 94941 ?
Unique (%) 99.8%

PERSON_CU_SIS_ID
Categorical

HIGH CARDINALITY

Distinct 94992
Distinct (%) 99.9%
Missing 0
Missing (%) 0.0%
Memory size 743.1 KiB
nan
 
37
109603673.0
 
2
109613756.0
 
2
108742279.0
 
2
107350533.0
 
2
Other values (94987)
95070 
Value Count Frequency (%)  
nan 37 < 0.1%
 
109603673.0 2 < 0.1%
 
109613756.0 2 < 0.1%
 
108742279.0 2 < 0.1%
 
107350533.0 2 < 0.1%
 
109503799.0 2 < 0.1%
 
109266883.0 2 < 0.1%
 
109526536.0 2 < 0.1%
 
109491382.0 2 < 0.1%
 
108761464.0 2 < 0.1%
 
Other values (94982) 95060 99.9%
 
2020-10-28T13:33:48.637733 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 94904 ?
Unique (%) 99.8%
Distinct 13
Distinct (%) < 0.1%
Missing 0
Missing (%) 0.0%
Memory size 743.1 KiB
2020 Fall
36112 
2019 Fall
29882 
2018 Fall
27078 
2021 Fall
 
953
2021 Spr
 
362
Other values (8)
 
728
Value Count Frequency (%)  
2020 Fall 36112 38.0%
 
2019 Fall 29882 31.4%
 
2018 Fall 27078 28.5%
 
2021 Fall 953 1.0%
 
2021 Spr 362 0.4%
 
2020 Spr 181 0.2%
 
2020 Sum 135 0.1%
 
2019 Spr 121 0.1%
 
2018 Sum 117 0.1%
 
2019 Sum 106 0.1%
 
Other values (3) 68 0.1%
 
2020-10-28T13:33:48.792892 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 1 ?
Unique (%) < 0.1%
Distinct 9
Distinct (%) < 0.1%
Missing 0
Missing (%) 0.0%
Memory size 743.1 KiB
2020 Fall
37056 
2019 Fall
29807 
2018 Fall
27480 
2020 Spr
 
170
2020 Sum
 
158
Other values (4)
 
444
Value Count Frequency (%)  
2020 Fall 37056 39.0%
 
2019 Fall 29807 31.3%
 
2018 Fall 27480 28.9%
 
2020 Spr 170 0.2%
 
2020 Sum 158 0.2%
 
2018 Sum 130 0.1%
 
2019 Sum 124 0.1%
 
2019 Spr 119 0.1%
 
2018 Spr 71 0.1%
 
2020-10-28T13:33:48.924833 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 0 ?
Unique (%) 0.0%

RESIDENCY_STATUS
Categorical

Distinct 5
Distinct (%) < 0.1%
Missing 0
Missing (%) 0.0%
Memory size 743.1 KiB
Non-Resident
65588 
Resident
27736 
Non-Resident International
 
1775
Deferral - needs new TC review
 
15
Referral
 
1
Value Count Frequency (%)  
Non-Resident 65588 69.0%
 
Resident 27736 29.2%
 
Non-Resident International 1775 1.9%
 
Deferral - needs new TC review 15 < 0.1%
 
Referral 1 < 0.1%
 
2020-10-28T13:33:49.051960 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 1 ?
Unique (%) < 0.1%
Distinct 1
Distinct (%) 0.1%
Missing 93963
Missing (%) 98.8%
Memory size 743.1 KiB
1
 
1152
(Missing)
93963 
Value Count Frequency (%)  
1 1152 1.2%
 
(Missing) 93963 98.8%
 

PERSON_SEX
Categorical

Distinct 2
Distinct (%) < 0.1%
Missing 0
Missing (%) 0.0%
Memory size 743.1 KiB
F
49223 
M
45892 
Value Count Frequency (%)  
F 49223 51.8%
 
M 45892 48.2%
 
2020-10-28T13:33:49.170649 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 0 ?
Unique (%) 0.0%

ETHNICITY
Categorical

MISSING

Distinct 6
Distinct (%) < 0.1%
Missing 1304
Missing (%) 1.4%
Memory size 743.1 KiB
White
67019 
Hispanic
13018 
Asian
10351 
Black or African American
 
1707
American Indian or Alaska Native
 
1173
Value Count Frequency (%)  
White 67019 70.5%
 
Hispanic 13018 13.7%
 
Asian 10351 10.9%
 
Black or African American 1707 1.8%
 
American Indian or Alaska Native 1173 1.2%
 
Native Hawaiian or Other Pacific Islander 543 0.6%
 
(Missing) 1304 1.4%
 
2020-10-28T13:33:49.297055 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 0 ?
Unique (%) 0.0%

PERSON_BIRTHDATE
Categorical

HIGH CARDINALITY

Distinct 1873
Distinct (%) 2.0%
Missing 0
Missing (%) 0.0%
Memory size 743.1 KiB
2001-12-27
 
147
2001-09-25
 
136
2001-11-08
 
136
2001-12-20
 
132
2002-04-16
 
129
Other values (1868)
94435 
Value Count Frequency (%)  
2001-12-27 147 0.2%
 
2001-09-25 136 0.1%
 
2001-11-08 136 0.1%
 
2001-12-20 132 0.1%
 
2002-04-16 129 0.1%
 
2001-09-21 129 0.1%
 
2002-04-22 128 0.1%
 
2001-10-18 128 0.1%
 
2002-01-07 126 0.1%
 
2002-03-22 126 0.1%
 
Other values (1863) 93798 98.6%
 
2020-10-28T13:33:49.458458 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 321 ?
Unique (%) 0.3%

ADDRESS_STREET_COMBINED
Categorical

HIGH CARDINALITY

Distinct 91588
Distinct (%) 96.3%
Missing 0
Missing (%) 0.0%
Memory size 743.1 KiB
nan
 
390
3-9D, Beijing Keji Huizhan Zhongxin No.48 Beisanhuan Xilu
 
8
Office of Educational Affairs Royal Thai Embassy 1906 23rd Street, N.W.
 
7
285 Caribou Ln
 
4
5250 S Geneva Way
 
4
Other values (91583)
94702 
Value Count Frequency (%)  
nan 390 0.4%
 
3-9D, Beijing Keji Huizhan Zhongxin No.48 Beisanhuan Xilu 8 < 0.1%
 
Office of Educational Affairs Royal Thai Embassy 1906 23rd Street, N.W. 7 < 0.1%
 
285 Caribou Ln 4 < 0.1%
 
5250 S Geneva Way 4 < 0.1%
 
PO Box 233 4 < 0.1%
 
9666 Pinebrook Way 4 < 0.1%
 
Office of Educational Affairs Royal Thai Embassy 1906 23rd Street, NW 4 < 0.1%
 
PO Box 302 4 < 0.1%
 
4876 W 117th Way 4 < 0.1%
 
Other values (91578) 94682 99.5%
 
2020-10-28T13:33:49.819846 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 88531 ?
Unique (%) 93.1%

ADDRESS_CITY
Categorical

HIGH CARDINALITY

Distinct 6135
Distinct (%) 6.5%
Missing 390
Missing (%) 0.4%
Memory size 743.1 KiB
Denver
 
3740
Aurora
 
1845
Boulder
 
1433
Colorado Springs
 
1298
Littleton
 
1229
Other values (6130)
85180 
Value Count Frequency (%)  
Denver 3740 3.9%
 
Aurora 1845 1.9%
 
Boulder 1433 1.5%
 
Colorado Springs 1298 1.4%
 
Littleton 1229 1.3%
 
Highlands Ranch 1053 1.1%
 
Centennial 971 1.0%
 
Austin 949 1.0%
 
Fort Collins 938 1.0%
 
Longmont 902 0.9%
 
Other values (6125) 80367 84.5%
 
2020-10-28T13:33:50.009633 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 2402 ?
Unique (%) 2.5%

ADDRESS_COUNTY
Categorical

HIGH CARDINALITY
MISSING

Distinct 1048
Distinct (%) 1.1%
Missing 2931
Missing (%) 3.1%
Memory size 743.1 KiB
Arapahoe
 
4054
Douglas
 
3829
Los Angeles
 
3756
Boulder
 
3672
Jefferson
 
3457
Other values (1043)
73416 
Value Count Frequency (%)  
Arapahoe 4054 4.3%
 
Douglas 3829 4.0%
 
Los Angeles 3756 3.9%
 
Boulder 3672 3.9%
 
Jefferson 3457 3.6%
 
Denver 3367 3.5%
 
Orange 2847 3.0%
 
Cook 2449 2.6%
 
El Paso 2324 2.4%
 
Santa Clara 2203 2.3%
 
Other values (1038) 60226 63.3%
 
(Missing) 2931 3.1%
 
2020-10-28T13:33:50.190378 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 226 ?
Unique (%) 0.2%

ADDRESS_REGION
Categorical

HIGH CARDINALITY

Distinct 683
Distinct (%) 0.7%
Missing 885
Missing (%) 0.9%
Memory size 743.1 KiB
CO
27997 
CA
18649 
IL
5169 
TX
5112 
WA
 
2664
Other values (678)
34639 
Value Count Frequency (%)  
CO 27997 29.4%
 
CA 18649 19.6%
 
IL 5169 5.4%
 
TX 5112 5.4%
 
WA 2664 2.8%
 
NY 2590 2.7%
 
MA 2372 2.5%
 
FL 2242 2.4%
 
NJ 2138 2.2%
 
MD 1603 1.7%
 
Other values (673) 23694 24.9%
 
2020-10-28T13:33:50.368235 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 384 ?
Unique (%) 0.4%

ADDRESS_POSTAL
Categorical

HIGH CARDINALITY

Distinct 85693
Distinct (%) 90.1%
Missing 0
Missing (%) 0.0%
Memory size 743.1 KiB
nan
 
887
81620
 
40
81632
 
38
80424
 
31
00000
 
28
Other values (85688)
94091 
Value Count Frequency (%)  
nan 887 0.9%
 
81620 40 < 0.1%
 
81632 38 < 0.1%
 
80424 31 < 0.1%
 
00000 28 < 0.1%
 
92067 24 < 0.1%
 
20008 22 < 0.1%
 
81631 21 < 0.1%
 
81631- 19 < 0.1%
 
518000 19 < 0.1%
 
Other values (85683) 93986 98.8%
 
2020-10-28T13:33:50.708870 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 78845 ?
Unique (%) 82.9%

ADDRESS_US_5_DIGIT_ZIP_CODE
Categorical

HIGH CARDINALITY

Distinct 8829
Distinct (%) 9.3%
Missing 0
Missing (%) 0.0%
Memory size 743.1 KiB
nan
 
2823
80016.0
 
749
80126.0
 
725
80027.0
 
659
80111.0
 
601
Other values (8824)
89558 
Value Count Frequency (%)  
nan 2823 3.0%
 
80016.0 749 0.8%
 
80126.0 725 0.8%
 
80027.0 659 0.7%
 
80111.0 601 0.6%
 
80134.0 571 0.6%
 
80503.0 491 0.5%
 
80108.0 453 0.5%
 
80304.0 429 0.5%
 
80130.0 428 0.4%
 
Other values (8819) 87186 91.7%
 
2020-10-28T13:33:50.894330 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 3102 ?
Unique (%) 3.3%

ADDRESS_COUNTRY
Categorical

HIGH CARDINALITY

Distinct 119
Distinct (%) 0.1%
Missing 388
Missing (%) 0.4%
Memory size 743.1 KiB
United States
92292 
China
 
409
India
 
326
Kuwait
 
140
Saudi Arabia
 
137
Other values (114)
 
1423
Value Count Frequency (%)  
United States 92292 97.0%
 
China 409 0.4%
 
India 326 0.3%
 
Kuwait 140 0.1%
 
Saudi Arabia 137 0.1%
 
United Arab Emirates 96 0.1%
 
Brazil 91 0.1%
 
United Kingdom 81 0.1%
 
Mexico 70 0.1%
 
Thailand 46 < 0.1%
 
Other values (109) 1039 1.1%
 
(Missing) 388 0.4%
 
2020-10-28T13:33:51.187694 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 25 ?
Unique (%) < 0.1%

PERSON_HOME_SCHOOL
Boolean

MISSING

Distinct 1
Distinct (%) 1.0%
Missing 95019
Missing (%) 99.9%
Memory size 743.1 KiB
1
 
96
(Missing)
95019 
Value Count Frequency (%)  
1 96 0.1%
 
(Missing) 95019 99.9%
 

PERSON_ACT_MAX_COMPOSITE
Real number (ℝ≥0)

MISSING

Distinct 29
Distinct (%) 0.1%
Missing 48103
Missing (%) 50.6%
Infinite 0
Infinite (%) 0.0%
Mean 28.48606739
Minimum 8
Maximum 45
Zeros 0
Zeros (%) 0.0%
Memory size 743.1 KiB
2020-10-28T13:33:51.322725 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/

Quantile statistics

Minimum 8
5-th percentile 21
Q1 26
median 29
Q3 32
95-th percentile 35
Maximum 45
Range 37
Interquartile range (IQR) 6

Descriptive statistics

Standard deviation 4.12395389
Coefficient of variation (CV) 0.1447709097
Kurtosis -0.322426669
Mean 28.48606739
Median Absolute Deviation (MAD) 3
Skewness -0.4209749781
Sum 1339187
Variance 17.00699568
Monotocity Not monotonic
2020-10-28T13:33:51.449732 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
Value Count Frequency (%)  
30 4286 4.5%
 
31 4034 4.2%
 
29 3869 4.1%
 
28 3815 4.0%
 
32 3788 4.0%
 
27 3673 3.9%
 
33 3496 3.7%
 
26 3381 3.6%
 
34 2931 3.1%
 
25 2901 3.0%
 
Other values (19) 10838 11.4%
 
(Missing) 48103 50.6%
 
Value Count Frequency (%)  
8 1 < 0.1%
 
10 1 < 0.1%
 
11 1 < 0.1%
 
12 2 < 0.1%
 
13 4 < 0.1%
 
Value Count Frequency (%)  
45 1 < 0.1%
 
36 433 0.5%
 
35 1973 2.1%
 
34 2931 3.1%
 
33 3496 3.7%
 

PERSON_SATI_MAX_TOTAL
Real number (ℝ≥0)

MISSING

Distinct 67
Distinct (%) 48.6%
Missing 94977
Missing (%) 99.9%
Infinite 0
Infinite (%) 0.0%
Mean 1157.818841
Minimum 323
Maximum 1600
Zeros 0
Zeros (%) 0.0%
Memory size 743.1 KiB
2020-10-28T13:33:51.596795 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/

Quantile statistics

Minimum 323
5-th percentile 808.5
Q1 1045
median 1170
Q3 1287.5
95-th percentile 1480
Maximum 1600
Range 1277
Interquartile range (IQR) 242.5

Descriptive statistics

Standard deviation 207.9185423
Coefficient of variation (CV) 0.1795777845
Kurtosis 1.370667655
Mean 1157.818841
Median Absolute Deviation (MAD) 120
Skewness -0.5743336599
Sum 159779
Variance 43230.12023
Monotocity Not monotonic
2020-10-28T13:33:51.750893 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Value Count Frequency (%)  
1070 6 < 0.1%
 
1240 5 < 0.1%
 
1100 5 < 0.1%
 
1210 5 < 0.1%
 
1230 5 < 0.1%
 
1160 4 < 0.1%
 
1220 4 < 0.1%
 
1170 4 < 0.1%
 
1250 4 < 0.1%
 
1290 4 < 0.1%
 
Other values (57) 92 0.1%
 
(Missing) 94977 99.9%
 
Value Count Frequency (%)  
323 1 < 0.1%
 
578 1 < 0.1%
 
640 1 < 0.1%
 
771 1 < 0.1%
 
780 1 < 0.1%
 
Value Count Frequency (%)  
1600 2 < 0.1%
 
1550 1 < 0.1%
 
1530 1 < 0.1%
 
1520 1 < 0.1%
 
1500 1 < 0.1%
 

PERSON_SATR_MAX_COMPS
Real number (ℝ≥0)

MISSING

Distinct 101
Distinct (%) 0.2%
Missing 33220
Missing (%) 34.9%
Infinite 0
Infinite (%) 0.0%
Mean 1272.069182
Minimum 37
Maximum 1600
Zeros 0
Zeros (%) 0.0%
Memory size 743.1 KiB
2020-10-28T13:33:51.905654 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/

Quantile statistics

Minimum 37
5-th percentile 1030
Q1 1170
median 1270
Q3 1380
95-th percentile 1510
Maximum 1600
Range 1563
Interquartile range (IQR) 210

Descriptive statistics

Standard deviation 145.0399757
Coefficient of variation (CV) 0.1140189369
Kurtosis -0.1440771506
Mean 1272.069182
Median Absolute Deviation (MAD) 100
Skewness -0.2222687104
Sum 78734722
Variance 21036.59456
Monotocity Not monotonic
2020-10-28T13:33:52.055762 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Value Count Frequency (%)  
1310 1664 1.7%
 
1230 1643 1.7%
 
1300 1624 1.7%
 
1260 1618 1.7%
 
1240 1614 1.7%
 
1220 1572 1.7%
 
1250 1566 1.6%
 
1320 1559 1.6%
 
1270 1556 1.6%
 
1330 1547 1.6%
 
Other values (91) 45932 48.3%
 
(Missing) 33220 34.9%
 
Value Count Frequency (%)  
37 1 < 0.1%
 
610 1 < 0.1%
 
611 1 < 0.1%
 
620 1 < 0.1%
 
640 1 < 0.1%
 
Value Count Frequency (%)  
1600 20 < 0.1%
 
1590 71 0.1%
 
1580 130 0.1%
 
1570 216 0.2%
 
1560 271 0.3%
 
Distinct 2
Distinct (%) 0.1%
Missing 93438
Missing (%) 98.2%
Memory size 743.1 KiB
1
 
956
0
 
721
(Missing)
93438 
Value Count Frequency (%)  
1 956 1.0%
 
0 721 0.8%
 
(Missing) 93438 98.2%
 
Distinct 2
Distinct (%) < 0.1%
Missing 82644
Missing (%) 86.9%
Memory size 743.1 KiB
Yes
12470 
No
 
1
(Missing)
82644 
Value Count Frequency (%)  
Yes 12470 13.1%
 
No 1 < 0.1%
 
(Missing) 82644 86.9%
 

PERSON_FAMILY_SIZE
Categorical

Distinct 2
Distinct (%) < 0.1%
Missing 0
Missing (%) 0.0%
Memory size 743.1 KiB
nan
94876 
11+
 
239
Value Count Frequency (%)  
nan 94876 99.7%
 
11+ 239 0.3%
 
2020-10-28T13:33:52.200864 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 0 ?
Unique (%) 0.0%
Distinct 12
Distinct (%) < 0.1%
Missing 227
Missing (%) 0.2%
Memory size 743.1 KiB
G. Bachelor's Level Degree
41615 
H. Some Graduate School
32008 
C. High School Graduate or Equivalent
7658 
D. Some College
6002 
E. Technical School
 
4056
Other values (7)
 
3549
Value Count Frequency (%)  
G. Bachelor's Level Degree 41615 43.8%
 
H. Some Graduate School 32008 33.7%
 
C. High School Graduate or Equivalent 7658 8.1%
 
D. Some College 6002 6.3%
 
E. Technical School 4056 4.3%
 
B. Less Than High School Graduate 3201 3.4%
 
A. Not Indicated 268 0.3%
 
I. Master's Level Degree 50 0.1%
 
K. Doctorate (Professional) 17 < 0.1%
 
F. 2-Year College Degree 7 < 0.1%
 
Other values (2) 6 < 0.1%
 
(Missing) 227 0.2%
 
2020-10-28T13:33:52.312980 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 0 ?
Unique (%) 0.0%

PARENT_CU_ATTENDANCE
Boolean

MISSING

Distinct 2
Distinct (%) 0.5%
Missing 94732
Missing (%) 99.6%
Memory size 743.1 KiB
0
 
323
1
 
60
(Missing)
94732 
Value Count Frequency (%)  
0 323 0.3%
 
1 60 0.1%
 
(Missing) 94732 99.6%
 

PARENT_CU_EMPLOYMENT
Boolean

MISSING

Distinct 2
Distinct (%) 0.4%
Missing 94602
Missing (%) 99.5%
Memory size 743.1 KiB
0
 
361
1
 
152
(Missing)
94602 
Value Count Frequency (%)  
0 361 0.4%
 
1 152 0.2%
 
(Missing) 94602 99.5%
 
Distinct 2
Distinct (%) 0.1%
Missing 93605
Missing (%) 98.4%
Memory size 743.1 KiB
1
 
1433
0
 
77
(Missing)
93605 
Value Count Frequency (%)  
1 1433 1.5%
 
0 77 0.1%
 
(Missing) 93605 98.4%
 

PERSON_NATIVE_LANGUAGE
Categorical

HIGH CARDINALITY
MISSING

Distinct 53
Distinct (%) 0.1%
Missing 1896
Missing (%) 2.0%
Memory size 743.1 KiB
EN
88055 
SP
 
2557
CM
 
429
AR
 
314
FR
 
284
Other values (48)
 
1580
Value Count Frequency (%)  
EN 88055 92.6%
 
SP 2557 2.7%
 
CM 429 0.5%
 
AR 314 0.3%
 
FR 284 0.3%
 
RU 162 0.2%
 
VI 130 0.1%
 
GE 130 0.1%
 
PO 129 0.1%
 
KO 104 0.1%
 
Other values (43) 925 1.0%
 
(Missing) 1896 2.0%
 
2020-10-28T13:33:52.455678 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 5 ?
Unique (%) < 0.1%

PERSON_GROSS_FAMILY_INCOME
Categorical

MISSING

Distinct 7
Distinct (%) < 0.1%
Missing 31492
Missing (%) 33.1%
Memory size 743.1 KiB
G. More than $150,000
35305 
F. $100,000 - $149,999
11058 
E. $75,000 - $99,999
4963 
C. $35,000 - $59,999
4358 
B. $15,000 - $34,999
 
3204
Other values (2)
4735 
Value Count Frequency (%)  
G. More than $150,000 35305 37.1%
 
F. $100,000 - $149,999 11058 11.6%
 
E. $75,000 - $99,999 4963 5.2%
 
C. $35,000 - $59,999 4358 4.6%
 
B. $15,000 - $34,999 3204 3.4%
 
D. $60,000 - $74,999 3055 3.2%
 
A. $0 - $15,000 1680 1.8%
 
(Missing) 31492 33.1%
 
2020-10-28T13:33:52.595776 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 0 ?
Unique (%) 0.0%

PERSON_FAFSA_SUBMITTED
Boolean

MISSING

Distinct 1
Distinct (%) < 0.1%
Missing 33655
Missing (%) 35.4%
Memory size 743.1 KiB
1
61460 
(Missing)
33655 
Value Count Frequency (%)  
1 61460 64.6%
 
(Missing) 33655 35.4%
 

PERSON_MILITARY_STATUS
Categorical

MISSING

Distinct 4
Distinct (%) 3.1%
Missing 94984
Missing (%) 99.9%
Memory size 743.1 KiB
Previously Served
69 
Current Dependent
44 
On Active Duty U.S. Military
12 
No relationship
 
6
Value Count Frequency (%)  
Previously Served 69 0.1%
 
Current Dependent 44 < 0.1%
 
On Active Duty U.S. Military 12 < 0.1%
 
No relationship 6 < 0.1%
 
(Missing) 94984 99.9%
 
2020-10-28T13:33:52.724027 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 0 ?
Unique (%) 0.0%

PERSON_HS_RANK_PERCENTILE
Real number (ℝ)

MISSING

Distinct 110
Distinct (%) 0.3%
Missing 60005
Missing (%) 63.1%
Infinite 0
Infinite (%) 0.0%
Mean 77.2958986
Minimum -1140
Maximum 100
Zeros 8
Zeros (%) < 0.1%
Memory size 743.1 KiB
2020-10-28T13:33:52.856226 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/

Quantile statistics

Minimum -1140
5-th percentile 41
Q1 66
median 82
Q3 93
95-th percentile 99
Maximum 100
Range 1240
Interquartile range (IQR) 27

Descriptive statistics

Standard deviation 20.9363599
Coefficient of variation (CV) 0.270859907
Kurtosis 498.9718105
Mean 77.2958986
Median Absolute Deviation (MAD) 12
Skewness -10.57533708
Sum 2713859
Variance 438.3311661
Monotocity Not monotonic
2020-10-28T13:33:53.004124 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Value Count Frequency (%)  
100 1371 1.4%
 
99 1280 1.3%
 
98 1176 1.2%
 
96 1084 1.1%
 
97 1072 1.1%
 
95 1042 1.1%
 
93 982 1.0%
 
94 943 1.0%
 
90 923 1.0%
 
92 911 1.0%
 
Other values (100) 24326 25.6%
 
(Missing) 60005 63.1%
 
Value Count Frequency (%)  
-1140 1 < 0.1%
 
-840 1 < 0.1%
 
-737 1 < 0.1%
 
-283 1 < 0.1%
 
-225 1 < 0.1%
 
Value Count Frequency (%)  
100 1371 1.4%
 
99 1280 1.3%
 
98 1176 1.2%
 
97 1072 1.1%
 
96 1084 1.1%
 

PERSON_HS_EQUIVALENCY
Boolean

MISSING

Distinct 1
Distinct (%) 0.9%
Missing 95008
Missing (%) 99.9%
Memory size 743.1 KiB
1
 
107
(Missing)
95008 
Value Count Frequency (%)  
1 107 0.1%
 
(Missing) 95008 99.9%
 

PERSON_CUMULATIVE_GPA
Real number (ℝ≥0)

MISSING

Distinct 1126
Distinct (%) 12.7%
Missing 86235
Missing (%) 90.7%
Infinite 0
Infinite (%) 0.0%
Mean 3.526430968
Minimum 0
Maximum 31.944
Zeros 10
Zeros (%) < 0.1%
Memory size 743.1 KiB
2020-10-28T13:33:53.287042 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/

Quantile statistics

Minimum 0
5-th percentile 2.33095
Q1 3.19
median 3.7
Q3 4
95-th percentile 4
Maximum 31.944
Range 31.944
Interquartile range (IQR) 0.81

Descriptive statistics

Standard deviation 0.6613074542
Coefficient of variation (CV) 0.1875288245
Kurtosis 386.6703661
Mean 3.526430968
Median Absolute Deviation (MAD) 0.3
Skewness 7.958966707
Sum 31314.707
Variance 0.437327549
Monotocity Not monotonic
2020-10-28T13:33:53.435108 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Value Count Frequency (%)  
4 3229 3.4%
 
3 793 0.8%
 
3.7 369 0.4%
 
3.5 243 0.3%
 
3.3 198 0.2%
 
2 178 0.2%
 
2.7 120 0.1%
 
3.85 111 0.1%
 
3.667 96 0.1%
 
3.75 62 0.1%
 
Other values (1116) 3481 3.7%
 
(Missing) 86235 90.7%
 
Value Count Frequency (%)  
0 10 < 0.1%
 
0.25 2 < 0.1%
 
0.27 1 < 0.1%
 
0.286 1 < 0.1%
 
0.333 3 < 0.1%
 
Value Count Frequency (%)  
31.944 1 < 0.1%
 
10.435 1 < 0.1%
 
6 2 < 0.1%
 
5.365 1 < 0.1%
 
5.091 1 < 0.1%
 

PERSON_HS_GPA_CONVERT
Real number (ℝ≥0)

Distinct 1467
Distinct (%) 1.6%
Missing 633
Missing (%) 0.7%
Infinite 0
Infinite (%) 0.0%
Mean 3.721951303
Minimum 0.109
Maximum 4
Zeros 0
Zeros (%) 0.0%
Memory size 743.1 KiB
2020-10-28T13:33:53.594656 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/

Quantile statistics

Minimum 0.109
5-th percentile 3.108
Q1 3.519
median 3.829
Q3 4
95-th percentile 4
Maximum 4
Range 3.891
Interquartile range (IQR) 0.481

Descriptive statistics

Standard deviation 0.3142671057
Coefficient of variation (CV) 0.08443611433
Kurtosis 0.7708223526
Mean 3.721951303
Median Absolute Deviation (MAD) 0.171
Skewness -1.086175734
Sum 351657.403
Variance 0.09876381375
Monotocity Not monotonic
2020-10-28T13:33:53.750485 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Value Count Frequency (%)  
4 33228 34.9%
 
3.5 613 0.6%
 
3.75 463 0.5%
 
3.9 443 0.5%
 
3.8 434 0.5%
 
3.7 420 0.4%
 
3.6 406 0.4%
 
3.95 364 0.4%
 
3.98 359 0.4%
 
3.76 334 0.4%
 
Other values (1457) 57418 60.4%
 
(Missing) 633 0.7%
 
Value Count Frequency (%)  
0.109 1 < 0.1%
 
0.54 1 < 0.1%
 
1.89 1 < 0.1%
 
1.97 1 < 0.1%
 
2 1 < 0.1%
 
Value Count Frequency (%)  
4 33228 34.9%
 
3.999 12 < 0.1%
 
3.998 13 < 0.1%
 
3.997 22 < 0.1%
 
3.996 33 < 0.1%
 

PERSON_HS_GPA_SR
Categorical

HIGH CARDINALITY
MISSING

Distinct 8559
Distinct (%) 13.0%
Missing 29472
Missing (%) 31.0%
Memory size 743.1 KiB
4.00
 
787
3.70
 
600
3.75
 
580
3.95
 
572
3.30
 
491
Other values (8554)
62613 
Value Count Frequency (%)  
4.00 787 0.8%
 
3.70 600 0.6%
 
3.75 580 0.6%
 
3.95 572 0.6%
 
3.30 491 0.5%
 
3.67 446 0.5%
 
3.97 443 0.5%
 
3.85 441 0.5%
 
3.98 432 0.5%
 
3.83 424 0.4%
 
Other values (8549) 60427 63.5%
 
(Missing) 29472 31.0%
 
2020-10-28T13:33:53.955075 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 4822 ?
Unique (%) 7.3%

1_HIGH_SCHOOL_CEEB_CODE
Categorical

HIGH CARDINALITY

Distinct 9661
Distinct (%) 10.2%
Missing 22
Missing (%) < 0.1%
Memory size 743.1 KiB
060515
 
1010
060118
 
896
060400
 
708
060115
 
634
060748
 
592
Other values (9656)
91253 
Value Count Frequency (%)  
060515 1010 1.1%
 
060118 896 0.9%
 
060400 708 0.7%
 
060115 634 0.7%
 
060748 592 0.6%
 
060130 557 0.6%
 
060747 533 0.6%
 
060928 498 0.5%
 
060163 408 0.4%
 
060086 391 0.4%
 
Other values (9651) 88866 93.4%
 
2020-10-28T13:33:54.145465 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 3849 ?
Unique (%) 4.0%

1_HIGH_SCHOOL_NAME
Categorical

HIGH CARDINALITY

Distinct 13440
Distinct (%) 14.1%
Missing 19
Missing (%) < 0.1%
Memory size 743.1 KiB
Cherry Creek High School
 
771
Fairview High School
 
733
East High School
 
539
Boulder High School
 
508
Rock Canyon High School
 
490
Other values (13435)
92055 
Value Count Frequency (%)  
Cherry Creek High School 771 0.8%
 
Fairview High School 733 0.8%
 
East High School 539 0.6%
 
Boulder High School 508 0.5%
 
Rock Canyon High School 490 0.5%
 
Mountain Vista High School 440 0.5%
 
Monarch High School 424 0.4%
 
Arapahoe High School 397 0.4%
 
Grandview High School 322 0.3%
 
Legacy High School 317 0.3%
 
Other values (13430) 90155 94.8%
 
2020-10-28T13:33:54.336678 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 5864 ?
Unique (%) 6.2%
Distinct 88
Distinct (%) 0.1%
Missing 19
Missing (%) < 0.1%
Memory size 743.1 KiB
Minimum 1993-12-01 00:00:00
Maximum 2024-08-01 00:00:00
2020-10-28T13:33:54.500575 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
2020-10-28T13:33:54.653866 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

1_HIGH_SCHOOL_REGION
Categorical

HIGH CARDINALITY
MISSING

Distinct 263
Distinct (%) 0.3%
Missing 2090
Missing (%) 2.2%
Memory size 743.1 KiB
CO
27800 
CA
18678 
IL
5149 
TX
5048 
WA
 
2644
Other values (258)
33706 
Value Count Frequency (%)  
CO 27800 29.2%
 
CA 18678 19.6%
 
IL 5149 5.4%
 
TX 5048 5.3%
 
WA 2644 2.8%
 
NY 2540 2.7%
 
MA 2437 2.6%
 
FL 2202 2.3%
 
NJ 2139 2.2%
 
PA 1614 1.7%
 
Other values (253) 22774 23.9%
 
(Missing) 2090 2.2%
 
2020-10-28T13:33:54.816810 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 111 ?
Unique (%) 0.1%

1_HIGH_SCHOOL_CITY
Categorical

HIGH CARDINALITY

Distinct 6651
Distinct (%) 7.0%
Missing 216
Missing (%) 0.2%
Memory size 743.1 KiB
Denver
 
2848
Aurora
 
1947
Boulder
 
1254
Littleton
 
1171
Highlands Ranch
 
1118
Other values (6646)
86561 
Value Count Frequency (%)  
Denver 2848 3.0%
 
Aurora 1947 2.0%
 
Boulder 1254 1.3%
 
Littleton 1171 1.2%
 
Highlands Ranch 1118 1.2%
 
Colorado Springs 1118 1.2%
 
Englewood 939 1.0%
 
DENVER 911 1.0%
 
Broomfield 826 0.9%
 
Austin 735 0.8%
 
Other values (6641) 82032 86.2%
 
2020-10-28T13:33:54.992513 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 2345 ?
Unique (%) 2.5%

1_HIGH_SCHOOL_TYPE
Categorical

Distinct 1
Distinct (%) < 0.1%
Missing 19
Missing (%) < 0.1%
Memory size 743.1 KiB
High School
95096 
Value Count Frequency (%)  
High School 95096 > 99.9%
 
(Missing) 19 < 0.1%
 
2020-10-28T13:33:55.253521 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 0 ?
Unique (%) 0.0%

1_HIGH_SCHOOL_HONORS
Unsupported

MISSING
REJECTED
UNSUPPORTED

Missing 95115
Missing (%) 100.0%
Memory size 743.2 KiB

CUB_IS__1_CHOICE
Categorical

MISSING

Distinct 2
Distinct (%) < 0.1%
Missing 67074
Missing (%) 70.5%
Memory size 743.1 KiB
Other college listed
27777 
Yes
 
264
Value Count Frequency (%)  
Other college listed 27777 29.2%
 
Yes 264 0.3%
 
(Missing) 67074 70.5%
 
2020-10-28T13:33:55.363210 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 0 ?
Unique (%) 0.0%
Distinct 2
Distinct (%) < 0.1%
Missing 20344
Missing (%) 21.4%
Memory size 743.1 KiB
No
52873 
Yes
21898 
(Missing)
20344 
Value Count Frequency (%)  
No 52873 55.6%
 
Yes 21898 23.0%
 
(Missing) 20344 21.4%
 

ADMITTED_COLLEGE
Categorical

Distinct 8
Distinct (%) < 0.1%
Missing 1
Missing (%) < 0.1%
Memory size 743.1 KiB
ARSCU
48477 
MULTU
16251 
ENGRU
14051 
BUSNU
8444 
CMCIU
 
4833
Other values (3)
 
3058
Value Count Frequency (%)  
ARSCU 48477 51.0%
 
MULTU 16251 17.1%
 
ENGRU 14051 14.8%
 
BUSNU 8444 8.9%
 
CMCIU 4833 5.1%
 
EDUCU 1280 1.3%
 
ARPLU 1252 1.3%
 
MUSCU 526 0.6%
 
(Missing) 1 < 0.1%
 
2020-10-28T13:33:55.481097 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 0 ?
Unique (%) 0.0%
Distinct 8
Distinct (%) < 0.1%
Missing 0
Missing (%) 0.0%
Memory size 743.1 KiB
College of Arts & Sciences
41792 
College of Engineering & Applied Science
23250 
Leeds School of Business
19052 
College of Media, Communication & Information
5648 
School of Education
 
1525
Other values (3)
 
3848
Value Count Frequency (%)  
College of Arts & Sciences 41792 43.9%
 
College of Engineering & Applied Science 23250 24.4%
 
Leeds School of Business 19052 20.0%
 
College of Media, Communication & Information 5648 5.9%
 
School of Education 1525 1.6%
 
Program in Exploratory Studies 1401 1.5%
 
Program in Environmental Design 1370 1.4%
 
College of Music 1077 1.1%
 
2020-10-28T13:33:55.608527 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 0 ?
Unique (%) 0.0%

APPLICATION_ORIGINAL_ACADEMIC_INTEREST
Categorical

HIGH CARDINALITY

Distinct 84
Distinct (%) 0.1%
Missing 1
Missing (%) < 0.1%
Memory size 743.1 KiB
Business - Open Option (Undecided)
10606 
Psychology
 
6188
Arts and Sciences - Open Option (Undecided)
 
5400
Aerospace Engineering Sciences
 
4614
Biological Sciences-Molecular, Cellular, & Developmental Biology
 
3873
Other values (79)
64433 
Value Count Frequency (%)  
Business - Open Option (Undecided) 10606 11.2%
 
Psychology 6188 6.5%
 
Arts and Sciences - Open Option (Undecided) 5400 5.7%
 
Aerospace Engineering Sciences 4614 4.9%
 
Biological Sciences-Molecular, Cellular, & Developmental Biology 3873 4.1%
 
Mechanical Engineering 3863 4.1%
 
Computer Science (Engineering, BS) 3543 3.7%
 
Biological Sciences-Integrative Physiology 3494 3.7%
 
Marketing 2801 2.9%
 
Engineering - Open Option (Undecided) 2700 2.8%
 
Other values (74) 48032 50.5%
 
2020-10-28T13:33:55.759690 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 0 ?
Unique (%) 0.0%

APPLICATION_SUBMITTED_DATE
Categorical

HIGH CARDINALITY

Distinct 859
Distinct (%) 0.9%
Missing 0
Missing (%) 0.0%
Memory size 743.1 KiB
2019-10-15 12:00 AM
 
5012
2019-11-15 12:00 AM
 
4177
2018-10-30 12:00 AM
 
3919
2018-11-15 12:00 AM
 
3542
2017-11-15 12:00 AM
 
3485
Other values (854)
74980 
Value Count Frequency (%)  
2019-10-15 12:00 AM 5012 5.3%
 
2019-11-15 12:00 AM 4177 4.4%
 
2018-10-30 12:00 AM 3919 4.1%
 
2018-11-15 12:00 AM 3542 3.7%
 
2017-11-15 12:00 AM 3485 3.7%
 
2019-11-14 12:00 AM 2620 2.8%
 
2020-01-15 12:00 AM 2513 2.6%
 
2018-01-15 12:00 AM 2362 2.5%
 
2017-11-14 12:00 AM 2282 2.4%
 
2018-11-14 12:00 AM 2114 2.2%
 
Other values (849) 63089 66.3%
 
2020-10-28T13:33:55.927736 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 171 ?
Unique (%) 0.2%
Distinct 15
Distinct (%) < 0.1%
Missing 4
Missing (%) < 0.1%
Memory size 743.1 KiB
Admit First Choice Program
70286 
Admit Exploratory Studies
10982 
Admit Arts and Science AMO
 
5062
Admit Pre Business
 
4182
Admit Pre Engineering
 
2408
Other values (10)
 
2191
Value Count Frequency (%)  
Admit First Choice Program 70286 73.9%
 
Admit Exploratory Studies 10982 11.5%
 
Admit Arts and Science AMO 5062 5.3%
 
Admit Pre Business 4182 4.4%
 
Admit Pre Engineering 2408 2.5%
 
Admit McNeill 582 0.6%
 
Admit Exploratory Studies with Excel Invite 500 0.5%
 
Admit Exploratory Studies with GoldShirt Invite 346 0.4%
 
Admit McNeill to Exploratory Studies 286 0.3%
 
Admit Arts and Sciences AMO with Excel Invite 181 0.2%
 
Other values (5) 296 0.3%
 
2020-10-28T13:33:56.080423 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 0 ?
Unique (%) 0.0%

ADMIT_DECISION_RELEASED_DATE
Categorical

HIGH CARDINALITY

Distinct 557
Distinct (%) 0.6%
Missing 1
Missing (%) < 0.1%
Memory size 743.1 KiB
2020-01-10 06:00 PM
21354 
2019-01-11 06:53 PM
9987 
2019-01-11 06:48 PM
9908 
2018-01-16 06:20 PM
6582 
2019-03-01 07:15 PM
5826 
Other values (552)
41457 
Value Count Frequency (%)  
2020-01-10 06:00 PM 21354 22.5%
 
2019-01-11 06:53 PM 9987 10.5%
 
2019-01-11 06:48 PM 9908 10.4%
 
2018-01-16 06:20 PM 6582 6.9%
 
2019-03-01 07:15 PM 5826 6.1%
 
2018-03-14 04:30 PM 5551 5.8%
 
2018-01-17 06:30 PM 4151 4.4%
 
2020-03-06 06:38 PM 3617 3.8%
 
2020-03-06 06:33 PM 2801 2.9%
 
2020-03-20 06:52 PM 2537 2.7%
 
Other values (547) 22800 24.0%
 
2020-10-28T13:33:56.231500 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 309 ?
Unique (%) 0.3%

ADMIT_DECISION_RECEIVED_DATE
Categorical

HIGH CARDINALITY
MISSING

Distinct 28071
Distinct (%) 35.0%
Missing 14935
Missing (%) 15.7%
Memory size 743.1 KiB
2020-01-10 06:50 PM
 
452
2020-01-10 06:51 PM
 
415
2020-01-10 06:49 PM
 
354
2020-01-10 06:44 PM
 
341
2020-01-10 06:48 PM
 
336
Other values (28066)
78282 
Value Count Frequency (%)  
2020-01-10 06:50 PM 452 0.5%
 
2020-01-10 06:51 PM 415 0.4%
 
2020-01-10 06:49 PM 354 0.4%
 
2020-01-10 06:44 PM 341 0.4%
 
2020-01-10 06:48 PM 336 0.4%
 
2020-01-10 06:47 PM 327 0.3%
 
2020-01-10 06:45 PM 319 0.3%
 
2020-01-10 06:46 PM 316 0.3%
 
2018-01-16 06:36 PM 304 0.3%
 
2020-01-10 06:52 PM 292 0.3%
 
Other values (28061) 76724 80.7%
 
(Missing) 14935 15.7%
 
2020-10-28T13:33:56.437880 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 20363 ?
Unique (%) 25.4%

DEPOSIT_DECISION_CONFIRMED_DATE
Categorical

HIGH CARDINALITY
MISSING

Distinct 12437
Distinct (%) 52.7%
Missing 71523
Missing (%) 75.2%
Memory size 743.1 KiB
2020-04-20 09:35 AM
 
194
2020-04-30 06:12 PM
 
144
2020-04-08 11:24 AM
 
127
2020-05-01 05:39 PM
 
118
2020-04-29 06:45 PM
 
87
Other values (12432)
22922 
Value Count Frequency (%)  
2020-04-20 09:35 AM 194 0.2%
 
2020-04-30 06:12 PM 144 0.2%
 
2020-04-08 11:24 AM 127 0.1%
 
2020-05-01 05:39 PM 118 0.1%
 
2020-04-29 06:45 PM 87 0.1%
 
2020-04-02 02:27 PM 84 0.1%
 
2020-04-27 06:55 PM 82 0.1%
 
2020-04-28 06:58 PM 82 0.1%
 
2020-05-04 07:40 PM 73 0.1%
 
2020-04-13 06:50 PM 73 0.1%
 
Other values (12427) 22528 23.7%
 
(Missing) 71523 75.2%
 
2020-10-28T13:33:56.627452 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 7427 ?
Unique (%) 31.5%
Distinct 1
Distinct (%) < 0.1%
Missing 74462
Missing (%) 78.3%
Memory size 743.1 KiB
1
20653 
(Missing)
74462 
Value Count Frequency (%)  
1 20653 21.7%
 
(Missing) 74462 78.3%
 
Distinct 8
Distinct (%) < 0.1%
Missing 0
Missing (%) 0.0%
Memory size 743.1 KiB
Admit
50536 
Withdraw
22329 
Deposit Paid
19850 
Defer (Deposit Paid)
 
1404
Defer Admit
 
498
Other values (3)
 
498
Value Count Frequency (%)  
Admit 50536 53.1%
 
Withdraw 22329 23.5%
 
Deposit Paid 19850 20.9%
 
Defer (Deposit Paid) 1404 1.5%
 
Defer Admit 498 0.5%
 
Deposit Pending 465 0.5%
 
Defer (Deposit Pending) 30 < 0.1%
 
Administrative Withdraw 3 < 0.1%
 
2020-10-28T13:33:56.777277 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 0 ?
Unique (%) 0.0%

PAST_APP_HISTORY
Categorical

HIGH CARDINALITY
MISSING

Distinct 328
Distinct (%) 27.9%
Missing 93941
Missing (%) 98.8%
Memory size 743.1 KiB
2019 Fall FR (2019-01-11 Admit: Admit First Choice Program)
149 
2018 Fall FR (2018-01-16 Admit: Admit First Choice Program)
106 
2019 Fall FR (2019-03-01 Admit: Admit First Choice Program)
 
69
2019 Fall FR (2019-01-11 Admit: Admit Exploratory Studies)
 
61
2018 Fall FR (2018-03-14 Admit: Admit First Choice Program)
 
55
Other values (323)
734 
Value Count Frequency (%)  
2019 Fall FR (2019-01-11 Admit: Admit First Choice Program) 149 0.2%
 
2018 Fall FR (2018-01-16 Admit: Admit First Choice Program) 106 0.1%
 
2019 Fall FR (2019-03-01 Admit: Admit First Choice Program) 69 0.1%
 
2019 Fall FR (2019-01-11 Admit: Admit Exploratory Studies) 61 0.1%
 
2018 Fall FR (2018-03-14 Admit: Admit First Choice Program) 55 0.1%
 
2018 Fall FR (2018-01-17 Admit: Admit First Choice Program) 35 < 0.1%
 
2018 Fall FR (2018-02-14 Admit: Admit First Choice Program) 30 < 0.1%
 
2019 Fall FR (2019-03-01 Admit: Admit Exploratory Studies) 24 < 0.1%
 
2018 Fall FR (2018-03-02 Admit: Admit First Choice Program) 20 < 0.1%
 
2018 Fall FR (2017-12-20 Admit: Admit First Choice Program) 15 < 0.1%
 
Other values (318) 610 0.6%
 
(Missing) 93941 98.8%
 
2020-10-28T13:33:56.925992 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 211 ?
Unique (%) 18.0%

PERSON_ENGAGEMENT
Categorical

MISSING

Distinct 6
Distinct (%) < 0.1%
Missing 53021
Missing (%) 55.7%
Memory size 743.1 KiB
Red
11780 
Black
10083 
Blue
9739 
Purple
5450 
Green
4571 
Value Count Frequency (%)  
Red 11780 12.4%
 
Black 10083 10.6%
 
Blue 9739 10.2%
 
Purple 5450 5.7%
 
Green 4571 4.8%
 
Yellow 471 0.5%
 
(Missing) 53021 55.7%
 
2020-10-28T13:33:57.078725 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 0 ?
Unique (%) 0.0%

TOTAL_PING_TIME__SEC
Real number (ℝ≥0)

ZEROS

Distinct 1370
Distinct (%) 1.4%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Mean 90.89132103
Minimum 0
Maximum 15012
Zeros 9336
Zeros (%) 9.8%
Memory size 743.1 KiB
2020-10-28T13:33:57.218887 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/

Quantile statistics

Minimum 0
5-th percentile 0
Q1 8
median 30
Q3 88
95-th percentile 416
Maximum 15012
Range 15012
Interquartile range (IQR) 80

Descriptive statistics

Standard deviation 184.5456879
Coefficient of variation (CV) 2.030399447
Kurtosis 517.0312719
Mean 90.89132103
Median Absolute Deviation (MAD) 27
Skewness 10.91742919
Sum 8645128
Variance 34057.11094
Monotocity Not monotonic
2020-10-28T13:33:57.364990 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Value Count Frequency (%)  
0 9336 9.8%
 
1 2569 2.7%
 
2 2145 2.3%
 
4 2010 2.1%
 
3 2005 2.1%
 
5 1772 1.9%
 
6 1700 1.8%
 
7 1584 1.7%
 
8 1496 1.6%
 
9 1424 1.5%
 
Other values (1360) 69074 72.6%
 
Value Count Frequency (%)  
0 9336 9.8%
 
1 2569 2.7%
 
2 2145 2.3%
 
3 2005 2.1%
 
4 2010 2.1%
 
Value Count Frequency (%)  
15012 1 < 0.1%
 
6345 1 < 0.1%
 
5590 1 < 0.1%
 
4521 1 < 0.1%
 
4477 1 < 0.1%
 

MESSAGE_COUNT
Real number (ℝ≥0)

Distinct 83
Distinct (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Mean 19.84733218
Minimum 0
Maximum 89
Zeros 622
Zeros (%) 0.7%
Memory size 743.1 KiB
2020-10-28T13:33:57.660873 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/

Quantile statistics

Minimum 0
5-th percentile 5
Q1 12
median 19
Q3 26
95-th percentile 39
Maximum 89
Range 89
Interquartile range (IQR) 14

Descriptive statistics

Standard deviation 10.64372964
Coefficient of variation (CV) 0.536280118
Kurtosis 0.5416944003
Mean 19.84733218
Median Absolute Deviation (MAD) 7
Skewness 0.652979402
Sum 1887779
Variance 113.2889807
Monotocity Not monotonic
2020-10-28T13:33:57.808863 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Value Count Frequency (%)  
16 3720 3.9%
 
18 3600 3.8%
 
17 3584 3.8%
 
15 3536 3.7%
 
14 3529 3.7%
 
13 3519 3.7%
 
19 3504 3.7%
 
20 3478 3.7%
 
12 3382 3.6%
 
11 3208 3.4%
 
Other values (73) 60055 63.1%
 
Value Count Frequency (%)  
0 622 0.7%
 
1 751 0.8%
 
2 902 0.9%
 
3 1022 1.1%
 
4 1374 1.4%
 
Value Count Frequency (%)  
89 2 < 0.1%
 
86 1 < 0.1%
 
84 1 < 0.1%
 
80 3 < 0.1%
 
79 1 < 0.1%
 

VIRTUAL_EVENT_ENGAGEMENT_COUNT
Real number (ℝ≥0)

ZEROS

Distinct 5
Distinct (%) < 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Mean 0.009378121222
Minimum 0
Maximum 4
Zeros 94294
Zeros (%) 99.1%
Memory size 743.1 KiB
2020-10-28T13:33:57.932961 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
median 0
Q3 0
95-th percentile 0
Maximum 4
Range 4
Interquartile range (IQR) 0

Descriptive statistics

Standard deviation 0.1049500664
Coefficient of variation (CV) 11.19094794
Kurtosis 208.6344267
Mean 0.009378121222
Median Absolute Deviation (MAD) 0
Skewness 12.97394872
Sum 892
Variance 0.01101451643
Monotocity Not monotonic
2020-10-28T13:33:58.029470 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=5)
Value Count Frequency (%)  
0 94294 99.1%
 
1 760 0.8%
 
2 52 0.1%
 
3 8 < 0.1%
 
4 1 < 0.1%
 
Value Count Frequency (%)  
0 94294 99.1%
 
1 760 0.8%
 
2 52 0.1%
 
3 8 < 0.1%
 
4 1 < 0.1%
 
Value Count Frequency (%)  
4 1 < 0.1%
 
3 8 < 0.1%
 
2 52 0.1%
 
1 760 0.8%
 
0 94294 99.1%
 

ON_CAMPUS_ENGAGEMENT_COUNT
Real number (ℝ≥0)

ZEROS

Distinct 7
Distinct (%) < 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Mean 0.3088892393
Minimum 0
Maximum 6
Zeros 67817
Zeros (%) 71.3%
Memory size 743.1 KiB
2020-10-28T13:33:58.136282 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
median 0
Q3 1
95-th percentile 1
Maximum 6
Range 6
Interquartile range (IQR) 1

Descriptive statistics

Standard deviation 0.5127107188
Coefficient of variation (CV) 1.659852962
Kurtosis 2.543493337
Mean 0.3088892393
Median Absolute Deviation (MAD) 0
Skewness 1.517398645
Sum 29380
Variance 0.2628722811
Monotocity Not monotonic
2020-10-28T13:33:58.232058 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
Value Count Frequency (%)  
0 67817 71.3%
 
1 25436 26.7%
 
2 1679 1.8%
 
3 155 0.2%
 
4 20 < 0.1%
 
5 7 < 0.1%
 
6 1 < 0.1%
 
Value Count Frequency (%)  
0 67817 71.3%
 
1 25436 26.7%
 
2 1679 1.8%
 
3 155 0.2%
 
4 20 < 0.1%
 
Value Count Frequency (%)  
6 1 < 0.1%
 
5 7 < 0.1%
 
4 20 < 0.1%
 
3 155 0.2%
 
2 1679 1.8%
 

MOST_RECENT_ON_CAMPUS_EVENT
Categorical

HIGH CARDINALITY
MISSING

Distinct 82
Distinct (%) 0.3%
Missing 67817
Missing (%) 71.3%
Memory size 743.1 KiB
Information Session & Campus Tour
19421 
Information Session & Campus Tours
 
1164
Explore CU Boulder
 
998
Explore CU
 
729
Be Boulder for a Day
 
659
Other values (77)
4327 
Value Count Frequency (%)  
Information Session & Campus Tour 19421 20.4%
 
Information Session & Campus Tours 1164 1.2%
 
Explore CU Boulder 998 1.0%
 
Explore CU 729 0.8%
 
Be Boulder for a Day 659 0.7%
 
Talented Scholars Day 511 0.5%
 
Engineer Your Future: Engineering Sampler 461 0.5%
 
Discover CU Boulder 308 0.3%
 
Ralphie's Group Visit 299 0.3%
 
Summer Sampler 209 0.2%
 
Other values (72) 2539 2.7%
 
(Missing) 67817 71.3%
 
2020-10-28T13:33:58.390382 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 6 ?
Unique (%) < 0.1%
Distinct 1557
Distinct (%) 5.7%
Missing 67817
Missing (%) 71.3%
Memory size 743.1 KiB
Minimum 2017-05-06 10:30:00
Maximum 2020-03-16 10:00:00
2020-10-28T13:33:58.554624 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
2020-10-28T13:33:58.700845 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

OFF_CAMPUS_ENGAGEMENT_COUNT
Real number (ℝ≥0)

ZEROS

Distinct 5
Distinct (%) < 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Mean 0.1372443884
Minimum 0
Maximum 4
Zeros 82972
Zeros (%) 87.2%
Memory size 743.1 KiB
2020-10-28T13:33:58.826166 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
median 0
Q3 0
95-th percentile 1
Maximum 4
Range 4
Interquartile range (IQR) 0

Descriptive statistics

Standard deviation 0.3725952432
Coefficient of variation (CV) 2.71483044
Kurtosis 7.946916358
Mean 0.1372443884
Median Absolute Deviation (MAD) 0
Skewness 2.760564873
Sum 13054
Variance 0.1388272153
Monotocity Not monotonic
2020-10-28T13:33:58.929391 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=5)
Value Count Frequency (%)  
0 82972 87.2%
 
1 11288 11.9%
 
2 803 0.8%
 
3 48 0.1%
 
4 4 < 0.1%
 
Value Count Frequency (%)  
0 82972 87.2%
 
1 11288 11.9%
 
2 803 0.8%
 
3 48 0.1%
 
4 4 < 0.1%
 
Value Count Frequency (%)  
4 4 < 0.1%
 
3 48 0.1%
 
2 803 0.8%
 
1 11288 11.9%
 
0 82972 87.2%
 

DEPOSIT_PAID
Boolean

Distinct 2
Distinct (%) < 0.1%
Missing 0
Missing (%) 0.0%
Memory size 743.1 KiB
No
75265 
Yes
19850 
Value Count Frequency (%)  
No 75265 79.1%
 
Yes 19850 20.9%
 
Distinct 12438
Distinct (%) 52.7%
Missing 71523
Missing (%) 75.2%
Memory size 743.1 KiB
Minimum 2017-11-05 17:21:10
Maximum 2020-09-04 03:49:35
2020-10-28T13:33:59.072736 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
2020-10-28T13:33:59.216781 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

CURRENT_BIN_NAME
Categorical

MISSING

Distinct 4
Distinct (%) < 0.1%
Missing 57359
Missing (%) 60.3%
Memory size 743.1 KiB
R - Admit
25830 
U - Withdraw
10618 
T - Future Cycle
 
1307
S - Deny
 
1
Value Count Frequency (%)  
R - Admit 25830 27.2%
 
U - Withdraw 10618 11.2%
 
T - Future Cycle 1307 1.4%
 
S - Deny 1 < 0.1%
 
(Missing) 57359 60.3%
 
2020-10-28T13:33:59.358222 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 1 ?
Unique (%) < 0.1%
Distinct 6
Distinct (%) 42.9%
Missing 95101
Missing (%) > 99.9%
Memory size 743.1 KiB
G. Bachelor's Level Degree
C. High School Graduate or Equivalent
H. Some Graduate School
D. Some College
B. Less Than High School Graduate
Value Count Frequency (%)  
G. Bachelor's Level Degree 5 < 0.1%
 
C. High School Graduate or Equivalent 3 < 0.1%
 
H. Some Graduate School 2 < 0.1%
 
D. Some College 2 < 0.1%
 
B. Less Than High School Graduate 1 < 0.1%
 
A. Not Indicated 1 < 0.1%
 
(Missing) 95101 > 99.9%
 
2020-10-28T13:33:59.489064 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 2 ?
Unique (%) 14.3%
Distinct 1
Distinct (%) 100.0%
Missing 95114
Missing (%) > 99.9%
Memory size 743.1 KiB
0
 
1
(Missing)
95114 
Value Count Frequency (%)  
0 1 < 0.1%
 
(Missing) 95114 > 99.9%
 
Distinct 1
Distinct (%) 100.0%
Missing 95114
Missing (%) > 99.9%
Memory size 743.1 KiB
0
 
1
(Missing)
95114 
Value Count Frequency (%)  
0 1 < 0.1%
 
(Missing) 95114 > 99.9%
 
Distinct 8
Distinct (%) 7.7%
Missing 95011
Missing (%) 99.9%
Memory size 743.1 KiB
D. Some College
35 
B. Less Than High School Graduate
25 
G. Bachelor's Level Degree
25 
C. High School Graduate or Equivalent
10 
H. Some Graduate School
 
3
Other values (3)
Value Count Frequency (%)  
D. Some College 35 < 0.1%
 
B. Less Than High School Graduate 25 < 0.1%
 
G. Bachelor's Level Degree 25 < 0.1%
 
C. High School Graduate or Equivalent 10 < 0.1%
 
H. Some Graduate School 3 < 0.1%
 
I. Master's Level Degree 3 < 0.1%
 
A. Not Indicated 2 < 0.1%
 
J. Doctorate (Academic) 1 < 0.1%
 
(Missing) 95011 99.9%
 
2020-10-28T13:33:59.630553 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 1 ?
Unique (%) 1.0%
Distinct 2
Distinct (%) 1.8%
Missing 95002
Missing (%) 99.9%
Memory size 743.1 KiB
0
 
107
1
 
6
(Missing)
95002 
Value Count Frequency (%)  
0 107 0.1%
 
1 6 < 0.1%
 
(Missing) 95002 99.9%
 
Distinct 2
Distinct (%) 1.6%
Missing 94992
Missing (%) 99.9%
Memory size 743.1 KiB
0
 
76
1
 
47
(Missing)
94992 
Value Count Frequency (%)  
0 76 0.1%
 
1 47 < 0.1%
 
(Missing) 94992 99.9%
 
Distinct 8
Distinct (%) 38.1%
Missing 95094
Missing (%) > 99.9%
Memory size 743.1 KiB
G. Bachelor's Level Degree
I. Master's Level Degree
K. Doctorate (Professional)
B. Less Than High School Graduate
F. 2-Year College Degree
Other values (3)
Value Count Frequency (%)  
G. Bachelor's Level Degree 8 < 0.1%
 
I. Master's Level Degree 4 < 0.1%
 
K. Doctorate (Professional) 3 < 0.1%
 
B. Less Than High School Graduate 2 < 0.1%
 
F. 2-Year College Degree 1 < 0.1%
 
E. Technical School 1 < 0.1%
 
D. Some College 1 < 0.1%
 
C. High School Graduate or Equivalent 1 < 0.1%
 
(Missing) 95094 > 99.9%
 
2020-10-28T13:33:59.889860 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 4 ?
Unique (%) 19.0%
Distinct 2
Distinct (%) 9.5%
Missing 95094
Missing (%) > 99.9%
Memory size 743.1 KiB
0
 
18
1
 
3
(Missing)
95094 
Value Count Frequency (%)  
0 18 < 0.1%
 
1 3 < 0.1%
 
(Missing) 95094 > 99.9%
 
Distinct 2
Distinct (%) 9.5%
Missing 95094
Missing (%) > 99.9%
Memory size 743.1 KiB
0
 
20
1
 
1
(Missing)
95094 
Value Count Frequency (%)  
0 20 < 0.1%
 
1 1 < 0.1%
 
(Missing) 95094 > 99.9%
 
Distinct 5
Distinct (%) 50.0%
Missing 95105
Missing (%) > 99.9%
Memory size 743.1 KiB
G. Bachelor's Level Degree
H. Some Graduate School
J. Doctorate (Academic)
A. Not Indicated
K. Doctorate (Professional)
Value Count Frequency (%)  
G. Bachelor's Level Degree 4 < 0.1%
 
H. Some Graduate School 2 < 0.1%
 
J. Doctorate (Academic) 2 < 0.1%
 
A. Not Indicated 1 < 0.1%
 
K. Doctorate (Professional) 1 < 0.1%
 
(Missing) 95105 > 99.9%
 
2020-10-28T13:34:00.024006 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 2 ?
Unique (%) 20.0%
Distinct 1
Distinct (%) 11.1%
Missing 95106
Missing (%) > 99.9%
Memory size 743.1 KiB
0
 
9
(Missing)
95106 
Value Count Frequency (%)  
0 9 < 0.1%
 
(Missing) 95106 > 99.9%
 
Distinct 2
Distinct (%) 20.0%
Missing 95105
Missing (%) > 99.9%
Memory size 743.1 KiB
1
 
7
0
 
3
(Missing)
95105 
Value Count Frequency (%)  
1 7 < 0.1%
 
0 3 < 0.1%
 
(Missing) 95105 > 99.9%
 
Distinct 10
Distinct (%) 0.1%
Missing 83799
Missing (%) 88.1%
Memory size 743.1 KiB
Colorado Free Application Day (Colorado residents only)
9666 
Other pre-approved application fee waiver (must have valid code)
 
458
CU Boulder application workshop participant
 
395
CU Boulder Legacy applicant (invitation only)
 
223
Pre-Collegiate Program participant (CO residents, by invitation only)
 
193
Other values (5)
 
381
Value Count Frequency (%)  
Colorado Free Application Day (Colorado residents only) 9666 10.2%
 
Other pre-approved application fee waiver (must have valid code) 458 0.5%
 
CU Boulder application workshop participant 395 0.4%
 
CU Boulder Legacy applicant (invitation only) 223 0.2%
 
Pre-Collegiate Program participant (CO residents, by invitation only) 193 0.2%
 
Outstanding Colorado Student (by invitation only) 132 0.1%
 
Active Duty or Veteran military personnel 126 0.1%
 
EngiNearMe Program participant (by invitation only) 54 0.1%
 
Boettcher Scholar semi-finalist (CO residents, by invitiation only) 49 0.1%
 
Pre-approved athletic waiver 20 < 0.1%
 
(Missing) 83799 88.1%
 
2020-10-28T13:34:00.151969 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 0 ?
Unique (%) 0.0%
Distinct 2
Distinct (%) 5.4%
Missing 95078
Missing (%) > 99.9%
Memory size 743.1 KiB
1
 
36
0
 
1
(Missing)
95078 
Value Count Frequency (%)  
1 36 < 0.1%
 
0 1 < 0.1%
 
(Missing) 95078 > 99.9%
 
Distinct 2
Distinct (%) 1.4%
Missing 94977
Missing (%) 99.9%
Memory size 743.1 KiB
1
 
137
0
 
1
(Missing)
94977 
Value Count Frequency (%)  
1 137 0.1%
 
0 1 < 0.1%
 
(Missing) 94977 99.9%
 
Distinct 2
Distinct (%) < 0.1%
Missing 67504
Missing (%) 71.0%
Memory size 743.1 KiB
0
26817 
1
 
794
(Missing)
67504 
Value Count Frequency (%)  
0 26817 28.2%
 
1 794 0.8%
 
(Missing) 67504 71.0%
 

PERSON_YEARS_LIVED_IN_U_S
Categorical

MISSING

Distinct 3
Distinct (%) 0.2%
Missing 93294
Missing (%) 98.1%
Memory size 743.1 KiB
0
1369 
<1
378 
>20
 
74
Value Count Frequency (%)  
0 1369 1.4%
 
<1 378 0.4%
 
>20 74 0.1%
 
(Missing) 93294 98.1%
 
2020-10-28T13:34:00.272476 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 0 ?
Unique (%) 0.0%

PERSON_YEARS_LIVED_OUTSIDE_U_S
Categorical

MISSING

Distinct 3
Distinct (%) < 0.1%
Missing 34832
Missing (%) 36.6%
Memory size 743.1 KiB
0
58098 
<1
 
2171
>20
 
14
Value Count Frequency (%)  
0 58098 61.1%
 
<1 2171 2.3%
 
>20 14 < 0.1%
 
(Missing) 34832 36.6%
 
2020-10-28T13:34:00.399116 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 0 ?
Unique (%) 0.0%
Distinct 1
Distinct (%) 0.5%
Missing 94896
Missing (%) 99.8%
Memory size 743.1 KiB
1
 
219
(Missing)
94896 
Value Count Frequency (%)  
1 219 0.2%
 
(Missing) 94896 99.8%
 
Distinct 1
Distinct (%) 0.1%
Missing 94038
Missing (%) 98.9%
Memory size 743.1 KiB
1
 
1077
(Missing)
94038 
Value Count Frequency (%)  
1 1077 1.1%
 
(Missing) 94038 98.9%
 

APPLICATION_BAKER_SCHOLAR
Boolean

MISSING

Distinct 1
Distinct (%) < 0.1%
Missing 90763
Missing (%) 95.4%
Memory size 743.1 KiB
1
 
4352
(Missing)
90763 
Value Count Frequency (%)  
1 4352 4.6%
 
(Missing) 90763 95.4%
 
Distinct 2
Distinct (%) < 0.1%
Missing 71674
Missing (%) 75.4%
Memory size 743.1 KiB
1
23440 
0
 
1
(Missing)
71674 
Value Count Frequency (%)  
1 23440 24.6%
 
0 1 < 0.1%
 
(Missing) 71674 75.4%
 

APPLICATION_DEAN_SCHOLAR
Boolean

MISSING

Distinct 1
Distinct (%) 0.1%
Missing 93802
Missing (%) 98.6%
Memory size 743.1 KiB
1
 
1313
(Missing)
93802 
Value Count Frequency (%)  
1 1313 1.4%
 
(Missing) 93802 98.6%
 

APPLICATION_HALE_SCHOLAR
Boolean

MISSING

Distinct 2
Distinct (%) 0.1%
Missing 93063
Missing (%) 97.8%
Memory size 743.1 KiB
1
 
2051
0
 
1
(Missing)
93063 
Value Count Frequency (%)  
1 2051 2.2%
 
0 1 < 0.1%
 
(Missing) 93063 97.8%
 
Distinct 1
Distinct (%) 0.2%
Missing 94703
Missing (%) 99.6%
Memory size 743.1 KiB
1
 
412
(Missing)
94703 
Value Count Frequency (%)  
1 412 0.4%
 
(Missing) 94703 99.6%
 
Distinct 1
Distinct (%) 0.4%
Missing 94868
Missing (%) 99.7%
Memory size 743.1 KiB
1
 
247
(Missing)
94868 
Value Count Frequency (%)  
1 247 0.3%
 
(Missing) 94868 99.7%
 
Distinct 1
Distinct (%) 0.4%
Missing 94885
Missing (%) 99.8%
Memory size 743.1 KiB
1
 
230
(Missing)
94885 
Value Count Frequency (%)  
1 230 0.2%
 
(Missing) 94885 99.8%
 
Distinct 1
Distinct (%) 100.0%
Missing 95114
Missing (%) > 99.9%
Memory size 743.1 KiB
0
 
1
(Missing)
95114 
Value Count Frequency (%)  
0 1 < 0.1%
 
(Missing) 95114 > 99.9%
 
Distinct 2
Distinct (%) 0.1%
Missing 91753
Missing (%) 96.5%
Memory size 743.1 KiB
1
 
3361
0
 
1
(Missing)
91753 
Value Count Frequency (%)  
1 3361 3.5%
 
0 1 < 0.1%
 
(Missing) 91753 96.5%
 
Distinct 1
Distinct (%) 0.1%
Missing 94131
Missing (%) 99.0%
Memory size 743.1 KiB
1
 
984
(Missing)
94131 
Value Count Frequency (%)  
1 984 1.0%
 
(Missing) 94131 99.0%
 
Distinct 1
Distinct (%) < 0.1%
Missing 92198
Missing (%) 96.9%
Memory size 743.1 KiB
1
 
2917
(Missing)
92198 
Value Count Frequency (%)  
1 2917 3.1%
 
(Missing) 92198 96.9%
 

APPLICATION_EXCEL
Categorical

MISSING

Distinct 4
Distinct (%) 0.3%
Missing 93729
Missing (%) 98.5%
Memory size 743.1 KiB
Excel referral
690 
Excel interview invite
461 
Excel admit
222 
Excel deny
 
13
Value Count Frequency (%)  
Excel referral 690 0.7%
 
Excel interview invite 461 0.5%
 
Excel admit 222 0.2%
 
Excel deny 13 < 0.1%
 
(Missing) 93729 98.5%
 
2020-10-28T13:34:00.529721 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 0 ?
Unique (%) 0.0%
Distinct 2
Distinct (%) < 0.1%
Missing 67510
Missing (%) 71.0%
Memory size 743.1 KiB
0
19294 
1
8311 
(Missing)
67510 
Value Count Frequency (%)  
0 19294 20.3%
 
1 8311 8.7%
 
(Missing) 67510 71.0%
 
Distinct 1
Distinct (%) 0.1%
Missing 93820
Missing (%) 98.6%
Memory size 743.1 KiB
1
 
1295
(Missing)
93820 
Value Count Frequency (%)  
1 1295 1.4%
 
(Missing) 93820 98.6%
 
Distinct 1
Distinct (%) 0.4%
Missing 94860
Missing (%) 99.7%
Memory size 743.1 KiB
1
 
255
(Missing)
94860 
Value Count Frequency (%)  
1 255 0.3%
 
(Missing) 94860 99.7%
 
Distinct 1
Distinct (%) 4.2%
Missing 95091
Missing (%) > 99.9%
Memory size 743.1 KiB
1
 
24
(Missing)
95091 
Value Count Frequency (%)  
1 24 < 0.1%
 
(Missing) 95091 > 99.9%
 
Distinct 1
Distinct (%) 0.7%
Missing 94968
Missing (%) 99.8%
Memory size 743.1 KiB
1
 
147
(Missing)
94968 
Value Count Frequency (%)  
1 147 0.2%
 
(Missing) 94968 99.8%
 
Distinct 1
Distinct (%) 0.5%
Missing 94898
Missing (%) 99.8%
Memory size 743.1 KiB
1
 
217
(Missing)
94898 
Value Count Frequency (%)  
1 217 0.2%
 
(Missing) 94898 99.8%
 

PERSON_DANIEL_S_SCHOLAR
Boolean

MISSING

Distinct 1
Distinct (%) 0.5%
Missing 94925
Missing (%) 99.8%
Memory size 743.1 KiB
1
 
190
(Missing)
94925 
Value Count Frequency (%)  
1 190 0.2%
 
(Missing) 94925 99.8%
 

PERSON_ENGINEARME
Boolean

MISSING

Distinct 1
Distinct (%) 3.4%
Missing 95086
Missing (%) > 99.9%
Memory size 743.1 KiB
1
 
29
(Missing)
95086 
Value Count Frequency (%)  
1 29 < 0.1%
 
(Missing) 95086 > 99.9%
 

HS_CITY_LOCATION
Categorical

HIGH CARDINALITY
MISSING

Distinct 3170
Distinct (%) 4.6%
Missing 25798
Missing (%) 27.1%
Memory size 743.1 KiB
Denver
 
2525
Highlands Ranch
 
1786
Aurora
 
1674
Boulder
 
1572
Colorado Springs
 
1433
Other values (3165)
60327 
Value Count Frequency (%)  
Denver 2525 2.7%
 
Highlands Ranch 1786 1.9%
 
Aurora 1674 1.8%
 
Boulder 1572 1.7%
 
Colorado Springs 1433 1.5%
 
Littleton 1036 1.1%
 
Greenwood Village 1010 1.1%
 
Fort Collins 953 1.0%
 
Broomfield 750 0.8%
 
Austin 739 0.8%
 
Other values (3160) 55839 58.7%
 
(Missing) 25798 27.1%
 
2020-10-28T13:34:00.685296 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 882 ?
Unique (%) 1.3%

HS_TOT_ENROLLMENT
Real number (ℝ≥0)

MISSING

Distinct 2588
Distinct (%) 3.1%
Missing 12240
Missing (%) 12.9%
Infinite 0
Infinite (%) 0.0%
Mean 1667.449979
Minimum 0
Maximum 5934
Zeros 7
Zeros (%) < 0.1%
Memory size 743.1 KiB
2020-10-28T13:34:00.836856 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/

Quantile statistics

Minimum 0
5-th percentile 402
Q1 1058
median 1641
Q3 2179
95-th percentile 3154
Maximum 5934
Range 5934
Interquartile range (IQR) 1121

Descriptive statistics

Standard deviation 843.437308
Coefficient of variation (CV) 0.5058246536
Kurtosis 0.6205936043
Mean 1667.449979
Median Absolute Deviation (MAD) 558
Skewness 0.5771017335
Sum 138189917
Variance 711386.4925
Monotocity Not monotonic
2020-10-28T13:34:00.980497 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Value Count Frequency (%)  
3720 1023 1.1%
 
2179 986 1.0%
 
2266 757 0.8%
 
2603 708 0.7%
 
1684 680 0.7%
 
2103 652 0.7%
 
1712 562 0.6%
 
2332 539 0.6%
 
2250 504 0.5%
 
2091 430 0.5%
 
Other values (2578) 76034 79.9%
 
(Missing) 12240 12.9%
 
Value Count Frequency (%)  
0 7 < 0.1%
 
3 3 < 0.1%
 
9 1 < 0.1%
 
12 2 < 0.1%
 
14 3 < 0.1%
 
Value Count Frequency (%)  
5934 45 < 0.1%
 
5286 66 0.1%
 
5098 47 < 0.1%
 
4797 31 < 0.1%
 
4609 3 < 0.1%
 

HS_NCES_ID
Categorical

HIGH CARDINALITY
MISSING

Distinct 6674
Distinct (%) 8.1%
Missing 12234
Missing (%) 12.9%
Memory size 743.1 KiB
080291000186
 
1010
080249000114
 
896
080336000338
 
708
A9101574
 
657
080249000101
 
634
Other values (6669)
78976 
Value Count Frequency (%)  
080291000186 1010 1.1%
 
080249000114 896 0.9%
 
080336000338 708 0.7%
 
A9101574 657 0.7%
 
080249000101 634 0.7%
 
080345001961 592 0.6%
 
080249001632 557 0.6%
 
080345001748 533 0.6%
 
080531000873 498 0.5%
 
080690001784 408 0.4%
 
Other values (6664) 76388 80.3%
 
(Missing) 12234 12.9%
 
2020-10-28T13:34:01.160884 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 2095 ?
Unique (%) 2.5%

HS_NAME
Categorical

HIGH CARDINALITY
MISSING

Distinct 6179
Distinct (%) 7.5%
Missing 12234
Missing (%) 12.9%
Memory size 743.1 KiB
Cherry Creek High School
 
1010
Fairview High School
 
897
East High School
 
770
Regis Jesuit High School
 
657
Boulder High School
 
634
Other values (6174)
78913 
Value Count Frequency (%)  
Cherry Creek High School 1010 1.1%
 
Fairview High School 897 0.9%
 
East High School 770 0.8%
 
Regis Jesuit High School 657 0.7%
 
Boulder High School 634 0.7%
 
Rock Canyon High School 592 0.6%
 
Monarch High School 558 0.6%
 
Mountain Vista High School 533 0.6%
 
Arapahoe High School 498 0.5%
 
Legacy High School 408 0.4%
 
Other values (6169) 76324 80.2%
 
(Missing) 12234 12.9%
 
2020-10-28T13:34:01.336465 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 1861 ?
Unique (%) 2.2%

HS_TYPE
Categorical

MISSING

Distinct 7
Distinct (%) < 0.1%
Missing 12234
Missing (%) 12.9%
Memory size 743.1 KiB
1
81407 
3
 
784
4
 
446
6
 
182
2
 
43
Other values (2)
 
19
Value Count Frequency (%)  
1 81407 85.6%
 
3 784 0.8%
 
4 446 0.5%
 
6 182 0.2%
 
2 43 < 0.1%
 
7 17 < 0.1%
 
5 2 < 0.1%
 
(Missing) 12234 12.9%
 
2020-10-28T13:34:01.483078 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 0 ?
Unique (%) 0.0%

HS_STATE
Categorical

HIGH CARDINALITY
MISSING

Distinct 52
Distinct (%) 0.1%
Missing 25798
Missing (%) 27.1%
Memory size 743.1 KiB
CO
23371 
CA
12837 
IL
4358 
TX
3490 
WA
 
2058
Other values (47)
23203 
Value Count Frequency (%)  
CO 23371 24.6%
 
CA 12837 13.5%
 
IL 4358 4.6%
 
TX 3490 3.7%
 
WA 2058 2.2%
 
MA 1952 2.1%
 
NJ 1829 1.9%
 
NY 1707 1.8%
 
PA 1235 1.3%
 
MD 1226 1.3%
 
Other values (42) 15254 16.0%
 
(Missing) 25798 27.1%
 
2020-10-28T13:34:01.634935 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 0 ?
Unique (%) 0.0%

HS_TEACHERS_FTE
Real number (ℝ≥0)

MISSING

Distinct 248
Distinct (%) 0.3%
Missing 12403
Missing (%) 13.0%
Infinite 0
Infinite (%) 0.0%
Mean 94.54786488
Minimum 0
Maximum 1050
Zeros 45
Zeros (%) < 0.1%
Memory size 743.1 KiB
2020-10-28T13:34:01.792320 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/

Quantile statistics

Minimum 0
5-th percentile 29
Q1 65
median 90
Q3 114
95-th percentile 181
Maximum 1050
Range 1050
Interquartile range (IQR) 49

Descriptive statistics

Standard deviation 46.06830286
Coefficient of variation (CV) 0.4872484738
Kurtosis 6.55545907
Mean 94.54786488
Median Absolute Deviation (MAD) 24
Skewness 1.272606028
Sum 7820243
Variance 2122.288529
Monotocity Not monotonic
2020-10-28T13:34:01.948770 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Value Count Frequency (%)  
97 1893 2.0%
 
107 1675 1.8%
 
100 1479 1.6%
 
104 1469 1.5%
 
80 1235 1.3%
 
103 1198 1.3%
 
75 1188 1.2%
 
90 1145 1.2%
 
181 1132 1.2%
 
87 1105 1.2%
 
Other values (238) 69193 72.7%
 
(Missing) 12403 13.0%
 
Value Count Frequency (%)  
0 45 < 0.1%
 
1 51 0.1%
 
2 96 0.1%
 
3 41 < 0.1%
 
4 18 < 0.1%
 
Value Count Frequency (%)  
1050 2 < 0.1%
 
362 16 < 0.1%
 
330 1 < 0.1%
 
327 17 < 0.1%
 
310 24 < 0.1%
 

HS_URBAN_CENTRIC_LOCALE_CODE
Categorical

MISSING

Distinct 12
Distinct (%) < 0.1%
Missing 12234
Missing (%) 12.9%
Memory size 743.1 KiB
21
36300 
11
16811 
12
7601 
13
7048 
41
6845 
Other values (7)
8276 
Value Count Frequency (%)  
21 36300 38.2%
 
11 16811 17.7%
 
12 7601 8.0%
 
13 7048 7.4%
 
41 6845 7.2%
 
22 1980 2.1%
 
33 1707 1.8%
 
23 1161 1.2%
 
31 1051 1.1%
 
42 981 1.0%
 
Other values (2) 1396 1.5%
 
(Missing) 12234 12.9%
 
2020-10-28T13:34:02.251150 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 0 ?
Unique (%) 0.0%

HS_ZIP
Categorical

HIGH CARDINALITY
MISSING

Distinct 4880
Distinct (%) 7.0%
Missing 25798
Missing (%) 27.1%
Memory size 743.1 KiB
80111
 
1010
80305
 
896
80016
 
776
80020
 
750
80206
 
708
Other values (4875)
65177 
Value Count Frequency (%)  
80111 1010 1.1%
 
80305 896 0.9%
 
80016 776 0.8%
 
80020 750 0.8%
 
80206 708 0.7%
 
80302 663 0.7%
 
80503 613 0.6%
 
80124 592 0.6%
 
80027 557 0.6%
 
80138 547 0.6%
 
Other values (4870) 62205 65.4%
 
(Missing) 25798 27.1%
 
2020-10-28T13:34:02.415449 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 1564 ?
Unique (%) 2.3%

HS_ADDRESS
Categorical

HIGH CARDINALITY
MISSING

Distinct 5482
Distinct (%) 7.9%
Missing 25798
Missing (%) 27.1%
Memory size 743.1 KiB
9300 East Union Avenue
 
1010
1515 Greenbriar Boulevard
 
896
1600 City Park Esplanade
 
708
1604 Arapahoe Avenue
 
634
5810 Mcarthur Ranch Road
 
592
Other values (5477)
65477 
Value Count Frequency (%)  
9300 East Union Avenue 1010 1.1%
 
1515 Greenbriar Boulevard 896 0.9%
 
1600 City Park Esplanade 708 0.7%
 
1604 Arapahoe Avenue 634 0.7%
 
5810 Mcarthur Ranch Road 592 0.6%
 
329 Campus Drive 557 0.6%
 
10585 Mountain Vista Ridge 533 0.6%
 
2201 East Dry Creek Road 498 0.5%
 
2701 West 136Th Street 408 0.4%
 
25901 Arapahoe Parkway 391 0.4%
 
Other values (5472) 63090 66.3%
 
(Missing) 25798 27.1%
 
2020-10-28T13:34:02.592071 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 1798 ?
Unique (%) 2.6%

HS_TYPE_VALUE
Categorical

MISSING

Distinct 11
Distinct (%) < 0.1%
Missing 12234
Missing (%) 12.9%
Memory size 743.1 KiB
Regular School
67337 
Regular Elementary Or Secondary
14070 
Special Program Emphasis
 
674
Other/Alternative School
 
418
Alternative/Other
 
182
Other values (6)
 
200
Value Count Frequency (%)  
Regular School 67337 70.8%
 
Regular Elementary Or Secondary 14070 14.8%
 
Special Program Emphasis 674 0.7%
 
Other/Alternative School 418 0.4%
 
Alternative/Other 182 0.2%
 
Vocational School 110 0.1%
 
Montessori 40 < 0.1%
 
Special Education 28 < 0.1%
 
Early Childhood Program/Child Care Center 17 < 0.1%
 
Special Education School 3 < 0.1%
 
(Missing) 12234 12.9%
 
2020-10-28T13:34:02.734548 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 0 ?
Unique (%) 0.0%

HS_CLASSIFICATION
Categorical

MISSING

Distinct 2
Distinct (%) < 0.1%
Missing 12234
Missing (%) 12.9%
Memory size 743.1 KiB
Public
67868 
Private
15013 
Value Count Frequency (%)  
Public 67868 71.4%
 
Private 15013 15.8%
 
(Missing) 12234 12.9%
 
2020-10-28T13:34:02.851957 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 0 ?
Unique (%) 0.0%

HS_CEEB
Categorical

HIGH CARDINALITY
MISSING

Distinct 6691
Distinct (%) 8.1%
Missing 12234
Missing (%) 12.9%
Memory size 743.1 KiB
060515
 
1010
060118
 
896
060400
 
708
060115
 
634
060748
 
592
Other values (6686)
79041 
Value Count Frequency (%)  
060515 1010 1.1%
 
060118 896 0.9%
 
060400 708 0.7%
 
060115 634 0.7%
 
060748 592 0.6%
 
060130 557 0.6%
 
060747 533 0.6%
 
060928 498 0.5%
 
060163 408 0.4%
 
060086 391 0.4%
 
Other values (6681) 76654 80.6%
 
(Missing) 12234 12.9%
 
2020-10-28T13:34:03.001645 image/svg+xml Matplotlib v3.3.2, https://matplotlib.org/
Frequencies of value counts

Unique

Unique 2100 ?
Unique (%) 2.5%